21.3 Beyond Sequence
303
To preserve spatial information in a tissue, in situ hybridization can be used to
obtain transcriptomic information. 7 This is called single-molecule fluorescence in
situ hybridization (smFISH), and many variants are being developed.
21.2
Applications to Disease and Other Phenomena
A good illustration of the power of multiomics is the decoding of the molecular pro-
cesses underlying circadian rhythms in green algae. 8 Understanding these processes
would scarcely have been possible without the parallel examination of genomic,
transcriptomic of metabolic data. Sharifi-Noghabi et al.2019 have developed useful
techniques for integrating such data. 9
The chromosomal disorder characteristic of cancer (Sect. 14.5) makes single-cell
examination of tumours an especially valuable route to obtaining insight into their
formation and development.
A tensor-based association test seems to be a useful for discovering novel disease
genes or mechanisms from multiomics data. 10
21.3
Beyond Sequence
Proteomics data (see Chaps. 18 and 23) are integrated with sequence information in
the attempt to assign function. Proteins whose mRNA levels are correlated with each
other, proteins whose homologues are fused into a single gene in some organisms,
those which have evolved in a correlated fashion, those whose homologues operate
together in a metabolic path or that are known to physically interact can all be con-
sidered to be linked in some way; for example, a protein of unknown function whose
expression profile (see Footnote 7 in Chap. 18) matches that of a protein of known
function in another organism is assigned the same function. In a literary analogy,
one could rank the frequencies of words in an unknown and known language and
assign the same meanings to the same ranks. Whether the syntax of gene expression
is sufficiently shared by all organisms to allow this to be done reliably is still an open
question. One can, however, probably assume that this syntax is shared by all cells
from the same organism.
Other kinds of data assisting protein function prediction are structure prediction,
intracellular localization, signal peptide cleavage sites of secreted proteins, glyco-
sylation sites, lipidation sites, phosphorylation sites, other sites for posttranslational
modification, cofactor binding sites, dehydron density, and so on.
7 Lubeck and Cai (2012).
8 Strenkert et al. (2019).
9 See also Lee et al. (2020) for a comprehensive review.
10 Chang et al. (2001).